Data-Driven UBM Generation via Tied Gaussians for GMM-Supervector Based Accent Identification
نویسندگان
چکیده
This paper presents a new approach to exploit data-driven universal background model (UBM) generation using tied Gaussians for accent identification (AID). The motivation of the proposed algorithm is to potentially utilize broad phoneticspecific accent characteristics by Gaussian mixture model (GMM) and examine data-driven phonetically-inspired UBM creation for GMM-supervector based accent classification. In this work, we discuss the issues involved in applying cumulative posterior probability based Gaussian selection and tree structure based UBM parameter estimation. Derivation and validation of the UBM refined by tied Gaussians are reported in this paper. Performance evaluations comparing our system with other well-known techniques for AID are also provided. Better performance is further achieved by fusing these acoustic-based accent classifiers. Comparison experiments conducted on the CSLU foreign-accented English (FAE) dataset show the effectiveness of the proposed method.
منابع مشابه
Evaluation and Assessment of Speech Intelligibility on Pathologic Voices Based upon Acoustic Speaker Models
We describe a GMM-UBM-based evaluation system for pathologic voices that uses standard cepstral features. Per speaker one GMM is created and its components are used to create a so-called GMM supervector. The supervector of each speaker is labeled with the intelligibility values obtained by human evaluation and is used to train an SVR. We studied different GMM supervectors containing different G...
متن کاملForeign accent detection from spoken Finnish using i-vectors
I-vector based recognition is a well-established technique in state-of-the-art speaker and language recognition but its use in dialect and accent classification has received less attention. We represent an experimental study of i-vector based dialect classification, with a special focus on foreign accent detection from spoken Finnish. Using the CallFriend corpus, we first study how recognition ...
متن کاملSwiss French Regional Accent Identification
In this paper an attempt is made to automatically recognize the speaker’s accent among regional Swiss French accents from four different regions of Switzerland, i.e. Geneva (GE), Martigny (MA), Neuchâtel (NE) and Nyon (NY). To achieve this goal, we rely on a generative probabilistic framework for classification based on Gaussian mixture modelling (GMM). Two different GMM-based algorithms are in...
متن کاملNoise Compensation for Speech Recognition Using Subspace Gaussian Mixture Models
In this paper, we adress the problem of additive noise which degrades substantially the performances of speech recognition system. We propose a cepstral denoising based on the Subspace Gaussian Mixture Models paradigm (SGMM). The acoustic space is modeled by using a UBM-GMM. Each phoneme is modeled by a GMM derived from the UBM. The concatenation of the means of a given GMM leads to a very high...
متن کاملDialect recognition using a phone-GMM-supervector-based SVM kernel
In this paper, we introduce a new approach to dialect recognition which relies on the hypothesis that certain phones are realized differently across dialects. Given a speaker’s utterance, we first obtain the most likely phone sequence using a phone recognizer. We then extract GMM Supervectors for each phone instance. Using these vectors, we design a kernel function that computes the similaritie...
متن کامل